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Abstract. Using the multiplicities of the Laplace eigenspace on the 
sphere (the space of spherical harmonics) we endow the space with 
Gaussian probability measure. This induces a notion of random Gauss- 
ian spherical harmonics of degree n having Laplace eigenvalue E = 
n(n+ 1). We study the length distribution of the nodal lines of random 
spherical harmonics. 

It is known that the expected length is of order n. It is natural to 
conjecture that the variance should be of order n, due to the natural 
scaling. Our principal result is that, due to an unexpected cancelation, 
the variance of the nodal length of random spherical harmonics is of 
order logn. This behaviour is consistent with the one predicted by 
Berry for nodal lines on chaotic billiards (Random Wave Model). In 
addition we find that a similar result is applicable for "generic" linear 
statistics of the nodal lines. 



1. Introduction 

Nodal patterns (first described by Ernest Chladni in 18th century) appear 
in many problems in engineering, physics and natural sciences: they describe 
the sets that remain stationary during vibrations. Hence, their importance 
in such diverse areas as musical instruments, mechanical structures, earth- 
quake study and other areas. They also arise in the study of wave propaga- 
tion and in astrophysics; this is a very active and rapidly developing research 
area. Let (Ai,g) be a compact manifold and / : Ai — > R be a real valued 
function. The nodal set of / is its zero set / -1 (0) = {x G Ai : f(x) = 0}. 

The most important or fundamental case is that of / being the eigenfunc- 
tion of the Laplace-Beltrami operator on Ai 

(1) A g f + Ef = 0, 

with E > 0. In this case it is known [8], that generically, the nodal sets 
are smooth submanifolds of Ai of codimension 1. For example, if Ai is a 
surface, the nodal sets are smooth curves, also called the nodal lines. One 
is interested in studying their volume (i.e. the length of the nodal line for 
the 2-dimensional case) and other properties for highly excited eigenstates. 
Yau conjectured |25|, [26] that the volume of the nodal set is commensurable 
to \[E in the sense that there exist constants cm , Cm > such that if / 
satisfies (P) then 

(2) c m Ve < Voltf-^O)) < C M VE. 
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The lower bound was proved by Bruning and Gromes [7] and Bruning [6] for 
the planar case. Donnelly and Fefferman |10| finally settled Yau's conjecture 
for real analytic metrics. However, the general case of a smooth manifold is 
still open. 

1.1. Spherical Harmonics. In this paper, we will concentrate on the nodal 
sets on the sphere. It is well known that the eigenvalues E of the Laplace 
equation 

A/ + Ef = 

on the m-dimensional sphere S m are all the numbers of the form 

(3) E™ = n{n + m- 1), 

where n is an integer. Given a number E™, the corresponding eigenspace is 
the space £™ of spherical harmonics of degree n. Its dimension is given by 

(4) TV = TV™ = 2n + m " 1 ( n + m ~ 1 U r^n-\ 
w n n + m-1 V m-1 J (m-l)\ 

Given an integral number n, we fix an L 2 (S m ) orthonormal basis of £™ 

giving an identification £™ = Wt^™ . For further reading on the spherical 
harmonics we refer the reader to [1], chapter 9. 

1.2. Random model. We consider a random eigenfunction 

V fc=i 

where are standard Gaussian N(0, 1) i.i.d. That is, we use the identifi- 
cation 

to endow the space £™ with Gaussian probability measure v as 

dv(r) _ 4w "fai---%j 

where a = (aj) G R^"" are as in ([5]). 

Note that v is invariant with respect to the orthonormal basis for £™. 
Moreover, the Gaussian random field f™ is isotropic in the sense that for 
every x\, . . . x\ G S m and every orthogonal R G 0(m + 1), 

(6) (URxi), URxi)) = (fn(xi), . . . , f n (xi)) . 

As usual, for any random variable X, we denote its expectation EX. For 
example, with the normalization factor in ([5|), for every m > 2, n and fixed 
point x G 5™, one has 

cm n 

(7) cwi^EW^. 

y " fc=i 

a simple corollary from the addition theorem (see [Tj, or f)18|) for m = 2). 
Any characteristic X(L) of the nodal set 

L = L(/™) = {* G S 2 : = 0} 
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is a random variable. The most natural characteristic of the nodal set Lfm 
of /™ is, of course, its (m — l)-dimensional volume Z(f™). The main goal 
of the present paper is the study of the distribution of the random variable 
Z(f™) for a random Gaussian f n £ £ ri . 

1.3. Some Conventions. Throughout the paper, the letters x,y will de- 
note either points on the sphere S m or spherical variables. For x, y E S m , 
d(x, y) will stand for the spherical distance between x and y. Given a set 
F C S m , we denote its area by \F\; len(C) will stand for the length of a 
smooth curve C C S m . For example, 



I5 2 | = 4tt, 



and 



Z(/ 2 ) = len({/ 2 (x) = 0}). 

In this paper we are mainly concerned with the 2-dimensional case. There- 
fore, to simplify the notation we will use f n (%) '■= 7n( x )> an d accordingly 
£ n := £l E n := El N n := Ml r% := r,f . 

In this manuscript, we will use the notations A <C B and A = 0{B) 
interchangeably. If necessary, the constant involved will depend on the pa- 
rameters written in the subscript. For example, O v or means that the 
constants involved depend on the function ip. 

1.4. Nodal length and related subjects. It is widely believed that for 
generic chaotic billiards, one can model the nodal lines for eigenfunctions 
of eigenvalue of order ~ E with nodal lines of isotropic, monochromatic 
random waves of wavenumber \[E (this is called Berry's Random Wave 
Model or RWM). Berry |3j found that the expected length (per unit area) 
of the nodal lines for the RWM is of size approximately \^E, and he argued 
that the variance should be of order log E. 

Berard [2] proved that for every m > 2, 
(8) E[Z(f™)]=c m . v fEf, 
where 

27T m/2 



^ v^r(f)' 

(see also [15] and [23]). Furthermore, Neuheisel [15] established an asymp- 
totic upper bound for the variance of the form 

/ iprn \ / pi? 

(9) Va4Z(0) = — 2-y )=0< • 



(m-^l) 2 I \ , m - 1 

n 3m + l / VA/ 3m + 1 

and in our previous work |23j . we improved the latter to be 

Var(Z(C)) = O 



Either of the bounds implies that the variance of the length, normalized so 
that its expected value is 1, vanishes with prescribed rate, 
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for the latter bound. This means that the constants Cg2 and C$2 guaranteed 
by Donnely-Fefferman ([2]) may be taken as essentially equal for "generic" 
eigenfunctions f™ E £™, where n is large. 

The volume of the nodal line of a random eigenfunction on the torus 

T 2 = R 2 /Z 2 

was studied by Rudnick and Wigman [16J and subsequently by Krishnapur 
and Wigman [14J. In this case, it is not difficult to see that the expectation 
is again M[Z(f' )] = const ■ yE. Their principal result is that as the 
eigenspace dimension J\f grows to infinity, the variance is bounded by 

and it is likely that it is asymptotic to Var(^(/' 7 " )) ~ *™ for a "generic" 
sequence of eigenvalues. 

For generic manifolds, one does not expect the Laplacian to have any 
multiplicities, so that we cannot introduce a Gaussian ensemble on the 
eigenspace. Let Ej be the eigenvalues and (pj the corresponding eigenfunc- 
tions. It is well known that the Ej are discrete, Ej — > oo and L 2 (A4) = 
span{4>j}. 

In this case, rather than considering random eigenfunctions, one consid- 
ers random combinations of eigenfunctions with growing energy window of 
either type 

f L (x)= ajcj>j{x) 
£?,e[o,E] 

(called the long range), or 

f S ( x ) = Yl a j4>j(x), 
y/Eje[VE,VE+i] 

(called the short range), as E — > oo. Berard [2] and Zelditch [27] found that 

EZ{f L ) ~C M -VE 
and recently Zelditch [27] proved that 

EZ(f) ~C m -Ve, 

notably with the same constant Cm > for both the long and the short 
ranges. 

For billiards (i.e. surfaces with piecewise smooth boundary), one is inter- 
ested in the number of intersections of the nodal line with the boundary, or, 
equivalently, the number of open nodal components. Toth and Wigman [21] 
studied the number of boundary intersections for random combinations of 
eigenfunctions f L (x) and f s (x) on generic billiards, defined precisely as 
above. They found that the expected number of the intersections is of order 

Ve. 

In the first part of this paper, we resolve the high energy asymptotic 
behaviour for the variance of the nodal length for random 2-dimensional 
spherical harmonics 

/„ = fl : S 2 -> M. 
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Theorem 1.1. One has 

(10) Var(Z(/ n )) = -logn + 0(l), 

asymptotically as n — ^ oo. 

For the higher dimensional sphere S m C ]R m+1 with m > 3, it is possible 
to prove |21] that 

Var( Z (O = (^)=0(^), 

and it is likely that 

Var(^(C)) ~ ~^~ 2 

for some constant c > 0. We intend to address the question of precise 
asymptotics for the higher dimensional case in the future. 

1.5. Smooth linear statistics. 2 

Rather than considering the volume of the full nodal line one may choose 
a nice submanifold F C S m of the sphere and consider the nodal volume 

Z F (f?) := Vol«C = 0} H F). 
inside F. More generally, let ip : S m ->Mbea function. One then defines 

Z^C) ■= J v(x)dVol f ^- Ho) (x). 
/^ _1 (0) 

The random variable Z^(f^) is called a (smooth) linear statistic of the 
nodal set. A priori, this definition makes sense only for continuous test func- 
tion ip G C(S m ), so that the restriction ¥?|^m-i( ) G C(/™ _:L (0)) is defined. 
Unfortunately, the class C(S 2 ) of continuous functions does not contain 
the characteristic functions of smooth sets. However, it is known [13] that 
for a smooth (m — l)-dimensional hypersurface C one can define the trace 
trc(<^) £ L l {C) of ip for some wider classes of functions such as W l,1 {S m ), the 
class of integrable functions with integrable weak derivatives, even though 
the values of ip G W 1 ' 1 (S m ) are defined up to measure zero spherical sets. 
To define the trace, one exploits the values of ip in a small tubular neigh- 
bourhood of C. 

Unfortunately again, the class W 1,1 does not contains the family of char- 
acteristic functions of nice sets. As an example, let us consider the 2- 
dimensional spherical disc F = B(N, |) CS 2 centered at the north pole of 
radius f , C = dF its boundary, and (p = xf- Then the definition of t^ciXF) 
is ambiguous since one may define it as either G L l {C) or 1 G L^iC). 
This phenomenon (i.e. the jump in / occurring precisely on C) is typical to 
the class BV{S m ) of functions of bounded variation; it is known [T3], that 
for any characteristic function \F of a submanifold F with C 2 boundary, 
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Xf £ BV(S m ), and, in addition, W^ l {S m ) C BV{S m ). It turns out that, 
despite this subtlety, one can still extend the notion of average trace 

to the full class p G BV(S m ) (see Appendix O for more details). For in- 
stance, in our previous example, trg F (xF) = \- It is then natural to define 

/^ _1 (o) 

It is easy to compute the expected value of a "generic" linear statistic, 
following along the lines of the proof of [23], Proposition 1.4, starting from 

dm!). 

Lemma 1.2. For 

p g W(5 2 )nL°°(5 2 ) 

we have 

f p{x)dx 

(11) E[Z*(f n )]= s2 23/2 

Remark 1.3. Note that /„ is odd or even if n is odd or even respectively, 
so that in particular the nodal lines are symmetric w.r.t. the involution 
x i — y —x. Therefore if (p is odd then Z v (f n ) vanishes identically in either 
case. Moreover, 

ZV(f n ) = Z*> ev (f n ), 

where the even part of p 

p™{x) := ^ {X) 

does not vanish identically, if and only if, p is not odd. Therefore, we may 
assume that <p is even in the first place, and we will assume so throughout 
the rest of this paper. 

Under the assumption of continuous differentiability we have the following 
result for the variance of Z^ for 2-dimensional spherical harmonics. 

Theorem 1.4. Let p : S 2 — )• R be a continuously differentiable even func- 
tion, which does not vanish identically. Then as n — )• oo 

(12) Var(^(/ n )) = c{<p) • logn + MoojV{tp) (l), 
where 

II l|2 

(13) c(p) := 65^gp > 0, 

i.e. the constant involved in the "O "-notation depends only on the L°° norm 
Halloo an d the total variation V(p) of p, and moreover, this dependency is 
monotone increasing. 

Unfortunately, Theorem 11.41 does not cover the characteristic functions 
of nice submanifolds. For this, we have Theorem 11.51 the main idea of its 
proof is approximating a function p G BV(S 2 ) with C°° functions pi, for 
which we apply Theorem II. 4[ We control the error term in (|12p applied to 
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Pi using its L°° norm and variation, which is why we included this technical 
statement in the formulation of Theorem 11.41 in the first place. 

Theorem 1.5. Let 

ip G BV(S 2 )nL°°(S 2 ) 
be a not identically vanishing even function. Then osn->oo 

(14) Var(^(/ n )) = c{<p) • logn + O v (l), 
where 

The characteristic function xf of a subsurface FC5 2 with C 2 boundary 
is of bounded variation i.e. xf £ BV(S 2 ), [13J Example 1.4. Therefore, 
in this case the statement of Theorem 11.51 is valid for Z F , as the following 
corollary states. 

Corollary 1.6. Let FC5 2 be a subsurface of the sphere with C 2 boundary. 
Then as n — > oo 

(15) Var(Z F (/„)) = c.logn + F (l), 

Remark 1.7. One may observe from the proof of Theorem 11.51 that the 
constant involved in the "O" -notation in f|X4j) depends only on ||</?||oo an d 
the total variation V(ip). In particular, the constant involved in the "de- 
notation (|15p depends only on the length of the boundary dF. 

1.6. Discussion. 

1.6.1. "Berry's Cancelation Phenomenon". Originally, it was conjectured 
that the variance Var(iJ(/„)) should be asymptotic to c ■ n, where c > 
is a constant, due to the natural scaling; however, it turned out that c 
vanishes, precisely as predicted by Berry [3J for the RWM. The reason for 
this phenomenon, which we refer to as "Berry's cancelation phenomenon", 
is that the leading nonconstant term in the long range asymptotics of the 
2-point correlation function is purely oscillating (see the Key Proposition 
13. 5|) . so that it does not contribute to the variance. The non-oscillating 
leading terms cancel (which is, according to Michael Berry [3J, "obscure"). 
It seems that "Berry's cancelation phenomenon" is of general nature: it also 
occurs on the torus p3], and it is likely to hold for random combinations of 
eigenfunctions on a generic manifold [22J. 



1.6.2. Spherical Harmonics vs. RWM. The principal result of the present 
paper shows that the behaviour of the nodal lines of 2-dimensional spherical 
harmonics of eigenvalue E is consistent with the RWM of wavenumber \fE, 
predicted for nodal lines of generic chaotic systems. In both cases, the 
expected nodal length is of order y/~E and variance of order logi^. More 
precisely, Berry [3] argued that for a billiard of area A, the variance of the 
nodal length should be asymptotic to 

logE 

512vr s 
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in the high energy limit. Taking into account the symmetry of the nodal lines 
on S 2 , its "effective" area is 2tt, and therefore, according to the RWM, the 
variance should be asymptotic to log E, which differs from the statement 
of Theorem II. II by a constant. 

There is a direct relation between the random spherical harmonics and 
the RWM. Kolmogorov's theorem implies that a random centered Gaussian 
ensemble of functions is determined by its covariance function (see section 
12.11). The covariance function for the RWM is 



trwm{x, y) = J (Ve\x - 

x,y 6 M 2 , and for the random spherical harmonics is 

r n (x,y) = P n (cos(d(x,y))), 

where P n are the Legendre polynomials. The Legendre polynomials admit 
Hilb's asymptotics 



P„(cos(0)) « J ^- Jo (<Kn + 1/2)), 
y sm0 

i.e. almost identical to RWM, up to the "correction factor" yj This 
factor seem to "know" about the geometry of the sphere; it is one of the 
underlying factors responsible for the difference in the constants in the vari- 
ance asymptotics. The geometry of the sphere occurs in some other places 
as well. 



1.6.3. Nodal Set vs. Level Sets. Interestingly, the behaviour of the level 
curves /~ 1 (L) for L > is very different. Let Z L (f n ) be the length of the 
level curve of f n . The expected length is [24j 

nZ L (f n )} = Cl e- L2 / 2 ^E~ 

consistent with the nodal case L = 0. However, unlike the nodal lines, the 
variance of the level curves length is asymptotic to [23] 

Var(Z L (/ n )) ~ c 2 L 4 e- L2 • n. 

1.6.4. Real vs. Complex Zeros. 

The behaviour of the zeros of complex analytic functions was studied ex- 
tensively in the recent years and it is interesting to learn that their behaviour 
is very different from our case of real valued spherical harmonics. Sodin and 
Tsirelson [20j considered 3 different models of random complex analytic 
functions tpi, : M — > C, all parametrized by an integer L — > oo, roughly 
corresponding to the degree of the harmonic polynomials n. Here Ai is the 
natural domain corresponding to the model with G-invariant measure m*, 
where G is a group of symmetries; Ai is either the sphere C U {oo}, the 
complex plain C or the unit disc {\z\ < 1}. In this case the set of zeros is 
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almost surely finite. The authors establish the asymptotic Gaussianity for 
smooth linear statistics h : M — > R, h G C 2 (M) 

Z h ^ L ):= Y, 
where the expected value is given by 



E[Z h (if> L )] =L-~ J hdm*, 



M 

for each of the models considered, consistent with (fTTj) . However the variance 
is of order 

(16) V a r(Z h ^ L ))~^\\A*h\\l 2{mf) 

decaying with L — > oo; here k > is a universal constant, and A* is the 
invariant Laplacian. Note also that here the dependency on the test func- 
tion h is via the L 2 norm of a second order differential operator acting on h 
(namely the invariant Laplacian), whereas in the real valued spherical har- 
monics case it depends on the I? norm of ip itself (i.e. the operator is the 
identity, see (JT2]) and (jlgjl ). 

For h = xu the characteristic function of a smooth domain U C M 
(i.e. Z h is the number of zeros in U), while the expected value of Z h (ip^) 
is still proportional to area(U) ■ L the variance is of different shape (cf. 
Corollary 11.61 in the spherical harmonics case). Namely it is known [12] 
that the variance is asymptotically proportional to ■ len(dll), different 
from Corollary 11.61 both in the power of L and the dependency on the test 
function. This reflects the high frequency oscillations of the zeros smoothed 
out by a smooth test function. 

Shiffman and Zelditch |X8|. [19] considered a more general situation of ran- 
dom independent Gaussian sections s\ = s 1 ' , . . . ,Sf. = s k ' € T(C L ,M.) of 
high powers C L , L — > oo of holomorphic line bundles C on an m-dimensional 
Kahler manifold M, where 1 < k < m. They considered the volume of the 
intersection of the zero sets of S( 

Z u (s 1 , ...8k) = Vol 2m -2fc [(s%, ■ ■ ■ s fe ) _1 (0)) 
and its smooth linear statistics 

Z h ( Sl ,...s k )= J h(z)dVol((s 1 ,...s k )- 1 (0)) 

( Sl ,... Sfc )-i(0) 

(here in case the system is full k = m, the volume is the number of points, 
and the integral is a sum). 

In both cases the expected value is asymptotic to 

E[Z v tf,...4)],E[z h tf t ...8i;j\~cL k , 

where as earlier, c > is proportional to either Vol({7) or the mass of h. For 
the "sharp" random variable they obtained [18] the asymptotic 

VarOZ^f, . . . s L k )) ~ c mk L 2k - m - 1 ' 2 ■ Vol(dU), 
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where c m k are some universal constants, extending Forrester-Honner |12j . 
whereas for the smooth statistics they established a Central Limit Theorem 
with variance 

Var(^( S f,...^))~c,L 2fc - m - 2 , 

where, as in case of Sodin-Tsirelson (|16|) . c/j involves a certain 2nd order 
differential operator acting on h. 

1.7. On the proof of the main results. The proof of the Theorem 11.11 
involves some geometric as well as some probabilistic aspects; we improve 
upon both in comparison with our previous paper. We employ the Kac-Rice 
formula, which reduces the computation of the length variance to the 2- 
point correlation function, given in terms of distribution of the values f n (x) 
as well as their gradients V/ n (x) E T X (S 2 ), for all x £ S 2 . 

Thanks to the isotropicity of the model, it is sufficient to evaluate the 
2-point correlation function only on the arc {0 = 0} (in the usual spherical 
coordinates); this reduces the problem to an essentially 1-dimensional one. 
One then has to identify the spaces T X (S 2 ) via a family of isometries (fr x , 
smooth w.r.t. x, for x on the arc only, which is natural in the spherical 
coordinates. Scaling the arc we find out that for typical x,y S S 2 , the dis- 
tribution of the values and the gradients is a small perturbation of standard 
Gaussian i.i.d random variables N(0,I), the latter recovering the square of 
the expected value of the nodal length to be canceled. We then expand the 
2-point correlation function into a Taylor polynomial around the asymptotic 
one; to do so we use Berry's elegant method. 

It turns out that the long range behaviour of the two-point correlation 
function given is also sufficient to extend the result to continuously differ- 
entiable linear statistics (i.e. Theorem II. 4p . In the course of generalizing 
the proof to include this case we naturally encounter an auxiliary function 
: [0, 7r] — > K. To conclude the proof of Theorem 11.41 we will have to 
understand its behaviour at the origin. 

To prove Theorem 11.51 we apply a standard density argument, approx- 
imating ip with C°° functions, to which we apply Theorem 11.41 To this 
end we use the full strength of the statement of Theorem 11.41 applied to 
<Pi, which enables to uniformly control the error term in (|12|) . For a more 
detailed explanation see section 15.11 

1.8. Plan of the paper. The goal of section [2] is to give a formula for the 
length variance, explicit as possible, starting from the classical Kac-Rice 
formula. In section [3J we use the formula obtained to analyze the variance, 
asymptotically for high energy (i.e. prove Theorem II. ip . In sections U] and 
O we give the proofs for Theorem 11.41 and Theorem 11.51 respectively. 

Appendix [X] will carry on a certain technical computation we will en- 
counter in this paper, namely, that of covariance matrix of a random vector 
involving values and gradients of f n . Appendix [B] will be devoted to the 
Legendre polynomials and some of their basic properties. The goal of Ap- 
pendix O is to give the definition and some properties of the class BV{S 2 ) 
of functions of bounded variation, including their traces on smooth curves. 
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2. An explicit integral formula for the variance 

In this section, culminating in Proposition 12, 7\ we derive an "explicit" 
integral formula for the variance. First we need to introduce the covariance 
function. 

2.1. Covariance function. The covariance function (sometimes also re- 
ferred to as two-point function) is defined as 

I c2| Mi 

(17) u n (x,y) :=E[f n (x)f n (y)] = L> J^^fo). 

■ /Vn fe=i 

It follows from the Kolmogorov theorem [S], that, in principle u n (x,y) de- 
termines the centered Gaussian random field f n , so that one can compute 
any property of /„ in terms of u n and its derivatives. By the addition 
theorem [I], page 456, theorem 9.6.3, u n (x,y) has an explicit expression as 

(18) u n (x, y) = P„(cos d(x, y)), 
where 

P n : [-1, 1] -)■ M 

is the Legendre polynomial of degree n (see e.g. [H]). Recall that d(x,y) is 
the spherical distance so that 

cosd(x,y) = (x,y), 

thinking of S 2 as being embedded into M 3 . 

The orthogonal invariance ([6]) is then equivalent to the corresponding 
property of the covariance function, namely 

(19) u n (Rx, Ry) = Un(x, y) 

for every orthogonal R £ 0(3). In case y is not specified, we take it to be 
the northern pole N 6 5 2 , that is 

(20) u n (x) :=u n (x,N). 

For every t £ [-1, 1], \P n (t)\ < 1 and \P n (t)\ = 1, if and only if t = ±1. 
Therefore 

(21) (u n (x,y) = ±l)^(x = ±y), 
and 

(22) (u n (x)=±l)^(xe{N,S}), 

where iV and S are the northern and the southern poles respectively. 
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2.2. Kac-Rice formulas for moments of length. In this section we ex- 
press the first couple of moments of Z(f n ) via the Kac-Rice formula. The 
most general version due to Bleher-Shiffman-Zelditch [H [5] gives an integral 
expression for all the moments k > 1 of the (m — Z)-dimensional volume of 
{J- = 0} for "generic" smooth vector valued random field 

f = (7i)i<i<j : M ->• M. 1 

defined on an m-dimensional smooth manifold M, 1 < I < m. In our 
previous paper |23j we gave an independent elementary proof for the Kac- 
Rice formula in the particular case of our interest M = S m , J- = f™, 
k = 1,2. 

To present the Kac-Rice formula in our case we will need some notation. 
For x,y G S 2 we define the following random vectors: 

Z? X = (f n (x),Vf n (x)) £lx T X (S 2 ) 

and 

(23) = (fn(x)J n (y)Vf n (x),Vf n (y)) G M 2 x T X (S 2 ) x T y (S 2 ) 

More generally, for k > 1 and x\, . . . , Xf* G S 2 one may define 

k 

Zk = Z n;*i,., Xk eR k x Y[ Txi (S 2 ) 

i=l 

in similar fashion. The vectors Z^ are all centered Gaussian in the sense 
that for every fixed x±,...,Xk G S 2 any linear functional of Z^ is mean zero 
Gaussian. Let 

D'k' Xl, '''' Xk {vi,. .. ,v k , .£ k ) 
be the (mean zero Gaussian) probability density function of z^' Xl '"'' Xk . The 
Kac-Rice formula expresses the k-th moment of Z(f n ) in terms of the dis- 
tributions of Zk only (see Lemma l2.ip . namely DJ • Therefore to express 
the variance (and the expected value) of Z(f n ) we will only need to study 
D?* and L>™ ; *. 

Lemma 2.1 ( [4J Theorem 2.2; [5] Theorem 4.3; [23] Proposition 3.3). The 
first two moments of the nodal length of the spherical harmonics are given 
by the following formulas. 

(1) Expectation: 

(24) E [£(/»»)] = / Pn(x)dx, 

s 2 

where the density of the zero set P n (x) is given by 
Pn{x)= f U\\DT' x (0,Odli- 

(2) Second moment: 



(25) 



E[Z(f n ) 2 ] = // K n (x,y)dxdy, 
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where the 2-point correlation function K n is given by 

(26) 

k n (x, y ) = - u 1 . = / f urii • \\e\\D2(o,o,e,en x de. 



2-Ky/l - u n (x,y) 2 

T x (S 2 )xT y (S 2 ) 

Neuheisel [15] (see also [23]) noticed that for every x S 5 2 , under any 
isometry T x (S m ) = R 2 (i.e. any choice of orthonormal basis of T X (S 2 )), 
the distribution of Z™' x is mean zero Gaussian with the diagonal covariance 
matrix 

'\ 

where 1% is the 2x2 identity matrix. It is then clear that P n is x-independent 
(this also follows from the rotational independence), and 

Pn{x) ' 



/2vr 

by a standard computation. This, together with (|24p . yields ([8]) for m = 2 
and finishes the treatment of the expectation in this casej. Moreover, slightly 
modifying the proof of (|24p . we obtain 



and (fTT]) follows. 



E[Z*(/ n )] = / ip(x)P n (x)dx, 
s 2 



The goal of the remaining part of the present section, culminating with 
Proposition 12.71 is to make the formula ([25]) for the second moment "ex- 
plicit" and suitable for asymptotic analysis. The rotational invariance (0) 
of our model implies that K n depends only on the spherical distance d(x, y) 
between x and y i.e. (with a slight abuse of notations) 

(27) K n (x,y) = K n (d(x,y)), 
which will be used later. 

Remark 2.2. One may define K n (x,y) intrinsically as 
Rn{ - X ^ = ^ h 1 < ^ E [ll V ^)H • l|V/(y)|||/(x) = f(y) = 0] , 

the expectation being one of the product of gradients conditioned on / 
vanishing at x and y. 

Remark 2.3. It is important to note that the symmetry of the nodal lines 
w.r.t. the involution x t— > —x (see Remark II. 3 p implies that 

(28) K n (x,y) = K n (x,-y). 



The main disadvantage of the formula (|26p is that one has to work with 
probability densities defined on the tangent planes T X {S 2 ) which depend on 
the point x £ S 2 . In principle one may consider the tangent planes being 
embedded in M 3 . This, however, is highly inadvisable since that would result 



3 The same computation gives the result for every m > 2 
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in working with singular Gaussians supported on a plane corresponding to 
T X (S 2 ). It is thus desired to identify for every x G S 2 

T X (S 2 ) ^ M 2 

via an isometry, i.e. fix an orthonormal basis B x varying smoothly for x G S 2 
(i.e. an orthonormal frame). Unfortunately it is impossible to choose a 
global orthonormal frame on S 2 ; however one can still get around that by 
noting that in fact all we need is a local choice for given x, y G S 2 . 

In general, the orthonormal frame chosen will affect the probability den- 
sity function D^'* of '* induced on M 6 (though D™'* will stay invariant); 
in section 12.31 we show how to compute '* for a given choice of local or- 
thonormal frames. In section |2~4"1 we will show how to choose the orthonor- 
mal frames to simplify the computations; we will use this construction while 
evaluating the two-point correlation function (|26p . 



we con- 



2.3. Kac-Rice formula in coordinate system. Given x,y G S 2 
sider two local orthonormal frames F x (z) = {efjef} and F y {z) = {e^e^}, 
defined in some neighbourhood of x and y respectively. This gives rise to 
(local) identifications 

(29) T X (S 2 ) =R 2 = T y (S 2 ), 



which are isometries. 

Under the identification (|29p the random vector (j23[) is a 
Gaussian with covariance matrix 



mean zero 



T,(x,y) 



where 
(30) 



-42x2 = A n (x,y) 



A 



1 



B 

C 



u n (x,y) 



u n (x,y) 

1 



(31) 

and 
(32) 



#2x4 = B n (x,y) 





V x u n {x,y) 



C 4X 4 = C n (x,y) 



H 



V y u n (x,y) 




H 



t E r , 



h 



with "pseudo-Hessian" 

(33) H 2t2 (x, y) = (V x (8) V^) u n (x, y), 

i.e. H = (hjk)j,k=i,2 with entries given by 

d 2 



h 



u n (x,y). 



The covariance matrix of the Gaussian distribution of Z 2 in 
upon f n (x) = f n (y) = is given b}0 

(34) n n (x, y) = C - B t A~ 1 B. 



conditioned 



This is the inverse of the lower right corner of E 1 
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We then have a frame-dependent formula for the two-point correlation 
function (l26l) 



K n (x,y)= 1 = // HHI • IKI|x 

y/l - u n {x,y) z 



(I ! dwidw 2 

x exp I --(w 1 ,w 2 )Un{x,y) (w 1 ,w 2/ 



(2vr) 3 v / det^ ri (x,y)' 



where Q n (x,y) is given by ([3 



Remark 2.4. Note that even though K n is rotational invariant (i.e. K n (x, y) 
depends only on the spherical distance d(x, y)), the same is, in general, false 
for the covariance matrices T, n (and $7 n ). 

Let 4>, 9 be the standard spherical coordinates on S 2 . Using the rotational 
invariance (|27p of the 2-point correlation function we obtain 



E[Z(f n ) 2 ] = JJ K n (x,y)dxdy = \S 2 \ J K n (N,x)dx 
S 2 xS 2 S 2 

= 2tt|5 2 | J K n (N, x((p))sm <pd(j), 
o 

where x(<p) £ S 2 is the point corresponding to the spherical coordinates 
(<j), 6 = 0). Note that K(N, x(f)) = K(x, y) for any x, y £ S 2 with d(x, y) = 
<f>. We therefore have the following corollary. 

Corollary 2.5. One has 

(35) E [Z(f n ) 2 ] = 2ir\S 2 \ f £ n (^)sin^#, 

o 

where 

K n (4)) = K n (x,y), 
x,y £ S 2 being any pair of points with d(x,y) = 4>. 

The main goal of the present paper is to understand the asymptotic be- 
haviour of the function K n ((j)). To this end we will have to provide a more 
explicit formula for K n by choosing concrete orthonormal frames in section 
12.41 It also turns out that it is more natural to scale the parameter by 
essentially n; this will be done in section [231 



2.4. Choosing orthonormal frames. Corollary 12.51 implies that it is suf- 
ficient to provide a choice x,y G S 2 with d(x, y) = 4> for any given <f> £ (0, 7r), 
and for the choice made, provide local frames around x and y. Hence we 
may restrict ourselves only to points on the half circular arc 

NS = {6 = 0}. 

Let 

(36) F = \e 1 = ^-,e 2 ' 



sin d> d6 
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be the orthonormal frame defined on S 2 \ {N, S}. 

Given <p G (0, tt) we choose any pair of points x, y G NS \ {N, S} with 
d(x,y) = (f) and set F x := F and F y := F locally in the neighbourhood of 
x and y respectively. An explicit computation shows that in this case the 
covariance matrix T, n (x, y) depends only on eft rather than on x, y, and, thus 
so does £l n ((f)) = Q n (x,y) of our interest. We compute in Appendix lAl the 
conditional distribution covariance matrix explicitly to be 



(37) 



o 

b 






En 
2 





En 



+ a 



c 


En I 

1 ' 



1 — P n (cos < 



whose entries are given by 
(38) a = a n {4>) = - 

(39) 

b = b n ((f>) = P^(cos 4>) cos cf>— P^'(cos (f))(sm<f>) 2 + 

and 
(40) 

We then have 

K n {4>) = 



P^(cos (/>) 2 (sin ( 



Pn (COS. 



1 — P n (cos ( 



•P^(cos 4>) 2 (sm ( 



Cn{4>) = -P^(C0S( 



V 1 - < x f 



\Wl 



top 



x exp 



--(tui,w 2 )fi n (0) ^toi,^) 



dwtdwz 



(27r)Vdetn n (^)- 



with the covariance matrix fi n given by ()37p . 



Remark 2.6. We choose to work with points on the arc {6 = 0} since 
here the covariance matrix Cl n (cp) is relatively simple. This corresponds to 
Berry's [3] choice of points on the x axis while dealing with random waves on 
M 2 , which takes advantage of the fact that for two points x, y G R 2 on the x 



axis the canonical orthonormal bases for T r 



and T„ 



coincide under 



the natural identification T X (M. 2 ) = T y (M?). Rather than working with the 
canonical bases, one may of course choose to work with such orthonormal 
bases for any two points on the plane; this approach results in the same 
computation as on the x axis. 

2.5. Scaling the integral formula. As pointed earlier, the two-point cor- 
relation function K n is expressible in terms of the covariance function u n 
and a couple of its derivatives, which in turn are expressible in terms of 
degree n Legendre polynomial and its derivatives. The high energy asymp- 
totics n — } oo of K is then intimately related to the behaviour of P n (cosd) 
for large n. It is known from the Hilb's asymptotics (see Appendix [B]) that 



P n (cOS( 



sin ( 



-J o (0(n + l/2)). 
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It is thus only natural to introduce a new parameter ip related to </> by 

9 = —, 
m 

where from this point and throughout the rest of the paper we denote 
(41) m := n + — . 

We will rewrite the formula (|35j) in terms of ij) rather than (f>, in hope to 
simplify the subsequent computations. 

Proposition 2.7. The variance of the nodal length is given by 



(42) 
where 

(43) 



Var(Z(/ n )) = 4vr 2 



71 + 1/2 



In ; 



o 



Kn{ip) - | ) sin(i/!/m)#, 



the scaled two-point correlation function 

1 , ,, 



(44) lJj:J 



101 • \\W2\\ X 



y 7 ! — u(x) 2 
x exp ^-^(wi,w 2 )A n ^y 1 (wi,w 2 y 



dz\dz2 



(27r)VdetA n (^) 



mf/i scaled covariance matrix 
0(^/m) 



(45) 



£?n/2 



/l + 2a 26 0\ 

1 2c 

26 l + 2a 

\ 2c 1 / 



whose entries are explicitly given by 
a = a n (V>) = —a n (ip/m) 

ti n 



(46) 



1 



1 



— o • PL (cos(ip / m)) 2 sin(-0/m) 2 , 

E n 1 - P n (cos^/m)) 2 nV V / " V / 7 



1 ~ 1 

6 = b n (tp) = —b n (ip/m) = — 



E n E n 



P' n (cos(ip/m)) cos(VV m ) 



(47) 

and 
(48) 



+ 



P% (cos(ip /m)) sm(t/j/m) 2 
P n (cos(^/m)) 



1 - P n (cos(i> /m)Y 



■ P' n (cos(ip/m)) sva(ip/mY 



c = Cn(ip) = —cn((if>/m)) = -^-P' n (cos(^/m)) 

tin tin 



Remark 2.8. Using the Cauchy-Schwartz inequality one can easily check 
that |6 n (V0|, l c n(VOI — §■ The inequality |a n (^)| < \ is obvious. 
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Remark 2.9. We rewrite ([28]) as 

(49) K n (<j>) = K n (ir - <t>). 

Remark 2.10. One can express K n (ip) in probabilistic language as 

K n(lp) = 1 E [\\U\\ • \\V\\] , 

(2n)y/l -P n (cos WM 2 

where (U, V) are mean zero Gaussian random variables with covariance ma- 
trix A n (ip). We will find this expression useful later, when we will study the 
asymptotic behaviour of K n for large ifi (see Proposition 13. 5p . 

3. ASYMPTOTICS FOR THE VARIANCE 

In this section we establish the asymptotics for the variance of the nodal 
length i.e. prove Theorem ll.il Recall that the variance of the nodal length is 
given by (|42|) . Thus Theorem II .11 is equivalent to the following Proposition. 

Proposition 3.1. As n — > oo one has 

t - 65 lQ g n , n ( - 
n "l28vr 2 n \n 

The rest of the present section is dedicated to the proof of Proposition 
3.1. Asymptotics for I n . Recall the definition 

Km 

I n = J (^K n {^) ~ sin (i>/m) dip 
o 

of I n , where K n is given by flH|) or, equivalent ly, ([6T]l . and m is related to 
n via (gU). 

We note that the scaled version of (l4"9l) is 



(50) K n (ij>) = K n (Trm-iP). 
Thus, by the definition (|4"3"j) of I n and ([50]) . we have 

irm/2 

(51) I n = 2- j ftf n (VO-^sin(^/m)#. 

o 

Therefore we may concentrate ourselves on [0, mn/2] rather than the full 
interval. 

Ideally, to evaluate I n , one would hope to obtain an explicit formula for 
K n (ijj). Unfortunately, to the best knowledge of the author of this paper, 
no such formula exists. However we will still be able to give an asymptotic 
expression for K n (ip) for large values of ip uniformly w.r.t. n and tp. 

For small values of ip the behaviour of K n is very different, due to the fact 
that as ip — > 0+, P n (cos(ip / m)) approaches 1, which results in the singularity 
of ; 1 and the of the covariance matrix A„(ib) at the origin. 

Nevertheless, we will see that this "singular" contribution is negligible, so 
that a relatively soft upper bound already obtained in [23] will suffice (see 
Lemma 13.21) . 
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More precisely, we choose a constant C > 0, which is kept fixed through- 
out the rest of the computations, and write 

Trm/2 C Trm/2 

(52) 

c 

The main contribution to the integral will come from the second ( "nonsingu- 
lar" ) integral in ([52]) i.e. outside the origin. Our first task is then to bound 
the first ("singular") integral of (f52|) . A satisfactory bound was already 
given in [23J. 

Lemma 3.2 (Restatement of Lemma 4.2 from [23J). For any constant 
C > we have as n — )• oo 
c 

K n (tp) — — sin(ip / m)dip = O 

o 

Lemma 13.21 together with (|52p and (|5ip yield the following lemma. 
Lemma 3.3. For any choice of the constant C > we have as n — > oo 

(53) I n = 2I n + O l ' 



/ 



where 

■Km i "1 

(54) I n = J \K n (1>)-^sm{1>/m)di/>. 

c 

Therefore, to understand the asymptotic behaviour of I n it is sufficient 
to understand the asymptotic behavior of I n . Proposition 13.41 resolves the 
latter. The proof of Proposition 13.41 is given throughout the rest of the 
present section. 

Proposition 3.4. For any choice of the constant C > in the definition 
(j54"|) of I n , we have as n — > oo 

/ _ _ \ f 65 log n ( \ 

(55) / " = 2W^- + n 



Proof of Proposition \3.1\ assuming Proposition \3.4\ Just use Proposition^ 
together with (j53|) . 



□ 

3.2. Asymptotics for the 2-point correlation function. Recall that 
K n (ip) is given by ()44p . One may notice that 

KnW = = = F(a„W,t„W,^W), 

27IV1 - P n {cos(ip/m)) A 

where F(a, /3, 7) is a smooth function independent of n, defined on some 
neighbourhood of the origin (a, ft, 7) = (0,0,0). Its arguments a n (ip), b n (ip) 
and c n {ip) are uniformly small for tp > C (see Lemma |3.9|) . 



Note that in 23 , Lemma 4.2 was given without scaling i.e. in terms of rather that 
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An easy explicit computation shows that 

F(0,0,0) = i 

which cancels out with the constant term in (|43[) which corresponds to 
(KZ) 2 . This is not a coincidence, since the origin a = j3 = 7 = cor- 
responds to the covariance matrix A n being the identity matrix A n = I n ; 
in this case the probability density function factors. 

We choose to expand F(a, /3, 7) into a finite Taylor polynomial around 
the origin^. We note that the matrix elements are of different order or decay 
rate, so that we may cut the smaller terms earlier than the larger ones. 
The decay rate of a n , b n and c n prescribed by Lemma [3^91 implies that it is 
sufficient to expand a n , b n and c n up to 2nd, 4th and 1st degrees respectively. 

The following is the Key Proposition for the whole paper. We will reuse 
it while proving Theorem 11.41 (see section 2]) for smooth linear statistics of 
the nodal line, (see also Remark [37! 



Proposition 3.5 (Key Proposition). For any choice of C > 0, as n — )• 00, 

one has 

(56) 

1 , 1 sin(2yQ 65 1 ,9 cos(2^) 

%W = 7 + ~ ■ / , / — \ + 777T - 5 -tii — 77 + 



4 2 nn sm(ifj / m) 256 -K 2 ns\n{ip /m)^p 32 imip sm.{ip/m) 



n 2 nip sm(ijj/m) ntp 
uniformly for C < ifi < irm/2. 

Remark 3.6. It is important to notice that the leading nonconstant term 
h Jlflul^s i s oscillating, and we will see that it does not contribute to 

£ 7T T L j 1 11 1 Ki) I Tib J 

the variance (see the proof of Proposition 13. ip . We observe this obscure 
"Berry's cancelation phenomenon", which is responsible for the variance 
being surprisingly small, in some other situations, such as Berry's original 
work |3j , and on the torus |14j . This suggests that this phenomenon is of a 
more general nature, and we expect it to occur on a "generic" surface [22J. 



Remark 3.7. As another application of Proposition 13. 5\ one may exploit it 
to study the morphology of the nodal lines for n-dependent linear statistics 
<p = <p n - It is most efficient for ip n whose support is not shrinking too rapidly 
(relatively to the scaling ip ~ ncj) we introduced earlier), for example ip n a 
characteristic function of a spherical disc of radius a n where 



n — v 00. 



For finer "local" statistics of the nodal lines, such as studying the nodal 
length inside a spherical disc of radius w ^ one needs to expand K n (ip) 
around the origin, where the behaviour is very different from the one at 
infinity. We may want to do so in the future. 



^Intuitively, the origin a = b — c = corresponds to xjj " = " 00 (see the decay at infinity 
in Lemma l3.9p . hence this expansion should be good for large values of tp. 
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The asymptotic evaluation (|56p in Proposition 13.51 is done in two steps. 
Lemma [3 . 81 provides an approximation for the two-point correlation function 
K n (ijj) with a polynomial in P n (cos(ip/m)), a n (ip), b n (ip) and c n (ip) i.e. 
the Taylor expansion of K n as a function of the above expressions. In 
the second step, performed in Lemma 13.91 we evaluate each of the terms 
appearing in the Taylor expansion obtained in the first step using the high 
degree asymptotics of the Legendre polynomials and its derivatives (Hilb 
asymptotics). Lemmas 13.81 and 13.91 are proved in sections 13.4.11 and 13.4.21 
respectively. 

Lemma 3.8. For C > large enough, one has the following expansion on 
[C,irm/2] 

(57) 

K n{i>) = 4 + 4 • a nW + g • ~ 7^ " ^nO) 2 ~ T£ ' a n {f)b n {^) 2 

+ • bnW 4 + I ■ Pn(cos(^/m)) 2 + - • a n (^)P„(cos(V/m)) 2 

+ -L • & n (V0 2 P„(cos(</>/m)) 2 + Jr • P n (cos(0/m)) 4 
lb 62 

+ O (P n (cOs(V/m)) 6 + an(V0 3 + bnW 5 + Cn(V>) 2 ) , 

where the constants involved in the "O "-notation depend only on C. 

Lemma 3.9. For n > 1, C < tp < irm/2 we have the following estimates 
for a,b,c and P n (cos(ip /m)) . 

(1) 



2 l + sin(2^) cos (2^) _ / 1 1 



(58) 



PJcosiip/m)) 2 = K YJ y ^' + — + — 

7rnsin(-^/m) Aimij) sm(ip/m) \V 3 V ;ri 



1 + sin(2^) Q ( 1 



7rnsin(^/m) \ip 



(2) 



a^Uh) = 1 ~ sin W + 3cos(2VQ _ 1 + cos(4yQ +Q (J_ + J_ 
" ' nn sin(ip / m) Aim 2 sin(ip/m) 2 2i: 2 n 2 sm(ip / m) 2 \V 3 flip 



7rnsin(^/m) \ip 



(3) 



= 1 + sin ( 2 V0 + 7cos(2V») + 1 + cos(4y>) + Q / J_ + J_ 
7rnsin(^/m) 47rnsin( , (/>/m)?/; 7r 2 nsin('i/>/ m )V' VV' 3 n V ; 



(4) 
(5) 



mvoi = o 



^3/2 



p, / / / ^4 |+2sin(2V;)-icos(4V;) (\ 
P n cos^/m = . + O -3 
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(6) 



(7) 
(8) 



(9) 



f-2sin(2T/;)-lco S (4yO (\ 
a n {ip) = 2 2 ■ m / ^2 + ° 73 



= f + 2 S m(2^)-ico S (#) + o / J_ 
7r 2 n 2 sin(^/m) 2 \V 3 



r> /■ / / / \\2 ( i \ 1 + C0S(4^) , „ / 1 

P„(cos(^/m)) a n {tp) = - . + ° 73 



o 2 _ 3/2 + 2sm(2^) - |cos(4^) ^ / 1 



P n (cos(W™)) X(V0 = 2 2 \ ' ,\ 2 +O -3 



(10) 



. 2 _ 1 + cos(4V0 , „ ( 1 



2TT z n z sva.[%p/m) z 



Proof of Proposition V3. 51 Substituting all the various estimates in Lemma 

into (|57|) we obtain, after collecting similar terms together and some 

i bv i 

n 2 sin(i/ , /rrt) 2 " nsm{%j}/m)%j} 



reorganization (replacing J,,.^ by rasin ,l /mWj whenever necessary) 



^n(V>) = \ + .),, , (\ ■ sm(2^) + \ sin(2V) + \ sm(2^) 

4 irnsm(ip/m) \4 8 8 

1 /111 133133 

+ 7r 2 nsm(^/m)^ \ 4 ' 2 + 8 ' ~ 32 ' 2 + 16 ' 2 + 128 ' 2 

111 13 33 
+ -- 0- -- - + — •- + — • - 



+ 





1 


:(- 


4 


1 


3 


16 ' 


2 




3 


G- 


4 




1 




4 



1 ^13 ,..17 ,..11 

+ 



nnijjsin(ijj/m) \4 4 8 4 8 4 

1 /ll ,. .. 1 1 , , 1 



-K 2 nipsm{ip/m) \ 4 2 v r/ 8 v ' 32 v v ry 2 



3 1 3 1 11 

+ — • -cos(4t/>) + — (2sin(2^) - -cos(40)) - - • -cos(4^) 

+ ^(2sin(2^) - ^cos(4V0) + ^ ■ (2sin(2V) - ~cos(4V))J + O + 
_ 1 + 1 sin(2V>) + 65 1 + 3 cos(2V>) 



4 2 7rn sm(ip/m) 256 7r 2 n sinty /mjifi 8 7rn'0 sin(ip/m) 
n(2^)- 2 ^cos(4^) / J | 1 \ 
Tr 2 nip sin(ip/m) nVv 



□ 



3.3. Concluding the proof of Proposition 13.41 All the hard work es- 
tablishing the asymptotics (|56p of K n (vp) at infinity finally pays off as the 
proof of Proposition 13.41 is now straightforward. 
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Proof of Proposition \3-4\ Recall that I n is given by (|54|) , where K n {ip) for 
large ip we may asymptotically expand K n {ip) as (f5"6"j) . First note that the 
constant term i in (I56D cancels out in (I54I). Thus we have 



(59) 



■Km/2 



O 



+ 



1 sin(2V>) _65 1 9 cos(2^) 

2 7rnsin( , 0/m) 256 7r 2 nsin(^/>/m)^ 32 nnip sm(ip/m 

/ 7rm/2 

sin{ip/rn)dtp + O 



|7 sin(2V;) _ n cos(4V)) 



7r 2 nip sm(ip/m) 



1 1 

■0 3 



\ 



sin( , /m)dip 



J 



1 

7rn 



Km/ 2 



C 
1 



1 . ,, 65 1 9 cos 2^) 
- s n(2V>) + + - 1 + — 

2 v ^ J 256 vr^ 32 V 



27 sin(2V) - ^ cos(4V) 



nip 



dip 



+ Ol- 

The contribution of the first term in (|59p is bounded by 

Km/2 



1 



n 



sin(2ip)dip = O 



n 



c 



The main contribution to (j59j) comes from the leading non-oscillatory 
term i.e. the second term: 



(60) 



05 



■Km/2 



2567r 2 n 



o 



ahp_ 



65 logn „ ( \ 
-2— + O 



256vr 



n 



n 



Bounding the contribution of the other oscillatory terms using integration 
by parts, as well as bounding the error term in (|59p . is easy. The asymptotic 
expression (j60|) together with the bound for the contribution of the other 
terms in (foUj) yields the result ([53]) of the present proposition. 

□ 



3.4. Proofs of auxiliary lemmas. 



3.4.1. Taylor expansion for K n (ip). In principle, one may compute the coef- 
ficients of the multivariate Taylor expansion directly using the Leibnitz rule 
for differentiating under the integral sign. The following elegant method 
due to Berry [3] gives the necessary Taylor coefficients avoiding the long 
and tedious computations. 

Proof of Lemma \3.8l We use Remark I2.10I to write 

(61) K n (iP) = 1 — E [\\U\\\\V\\] , 

(27r)yl - P„(cos ip /my 

where (U, V) is a mean zero multivariate Gaussian random vector with co- 
variance matrix A = A n (ip) given by (|45p . On [C, irm/2] we may Taylor 
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expand the first term in (|6ip as 
(62) 

1 1 



y/l - P„(cosV'/m) 5 



l+-P n (cos ip/m) 2 +-P n (cos ifj/mf+O (P n (cos i^/mf) 



since by part [T] of Lemma 13,91 |-fn(cos(^/m))| is bounded away from 1, 
provided that C is large enough. It then remains to expand the remaining 
part of (f6T|) i.e. E [\\U\\ \\V\\] in terms of powers of a, b and c. 
To this end we use the identity 



1 



a 



1 — e 2 



which implies 

E[||£/||||F||]=E 



(63) 



1 

2^ 



1 

2^ 



o jo 

oo roo 



t\\u\\ 
1 — e 2 



1 — e 2 



o JO 



[/(0,0)-/(i,0)-/(0, S ) + /(M)] 



dtds 

(ts) 3 / 2 

dtds 



{tsfl 2 ' 



where we define 
(64) 

/(M) = / A "W(M) :=E 



tiiu-ir+ S |ivii 



(2vr) 2 (detA ri (V)) 1 /2 



n i |\W- 1 +r 2 



s/ 2 



(detA n (V)) 1 /2det A n (^)-i + 



t/ 2 



a/ 2 



1/2 



Let 



where 



det J + 



th 



sh 



A„tyO 



1/2- 



2a 







; S = SnWO 



26 



2c/ ' 



with entries defined by (06]), (|4"T|) and (ji5|) . Thus we have 



/ + 



s/ 



A n ty>) 



(l + i)I + L4 t£ 

s.B (l + s)/ + s^ 
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so that 



del | / - \ th ^ ) A 



det ((1 + t)I + tA) det ((1 + s)I + sA- stB{{! + t)I + tAy 1 B) 

t 

s st „ / „ t x 1 



( 65 ) = (1 + *)*(! + ,)* det (l + A* 



xdet / + A-- - -B I / + A B , 

I 1 + s (l + s)(l + t) V 1 + * 



where we make use of the fact that both A and B are diagonal and hence 
commute. 

We compute the first determinant explicitly as 
(66) det f 1 + — ^— .4 | = 1 + — —2a. 

Next we wish to compute the other determinant in (|65p . For this we write 



where we understand the "O" -notation entry- wise. Therefore (taking ad- 
vantage of the fact that all the matrices involved are diagonal) , we have 

I+^-A--, 5 -B 2 (l + —A~' 



1 + s (l + a)(l + t) V 1 + t 

I + — - — A - - g -B 2 + 7 S S ^AB 2 + O (a 2 b 2 ) 

1 + s (l + s)(l + t) (l + s)(l + i) 2 1 j 

/l _l _2s_ n _ 4st l2 I 8st 2 .2 

1 ^ (l+ S )(l+i) + (l+ S )(l+t)^ a0 



+ O {a 2 b 2 ) 



V 1 (i+s)(i+*) c , 

Therefore, we have 

detf J + ^-A-t- -5 2 (/ + -^4' 

^ 1 + s (l + a)(l + t) V 1 + * 

2s 4st 2 8si 2 2 \ / 4st 

+ r+7 a " (i + s)(i+o 6 + (i + s)(i+o 2 y 1 mm c 



+ 0{(a 2 b 2 + c 2 )) 



1 + ; a - 7z rr-, sb + t-, rr, ^ ab + ( a b + c • 

1 + S (l + s)(l + t) (l + s)(l+t) 2 
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Substituting (j66|) and (|6"7|) into (j65|) we obtain 
(67) 

det ( /+ C /2 ,/ 2 ) A )^ 1 + ^ 1 + s ) 2 ( 1 + if7 a ) x 

' ! 1 + T^a - — + wf ,o «6 2 + (a 2 6 2 + c 2 ^ 



1 + s (l + «)(l + t) (l + s)(l + i) 2 
= (l + i) 2 (l + s) 2 x 

/ 2{t + s + 2st) 4st , 2 , 2 , „/ 2 , 2 2 \^\ 



Now we use the expansion 



1 - 1 _ * + + 0(a , 3) 



to write 
(68) 



(t + s + 2st) 2st l2 / 3s 2 3t 2 st 



(l + fl)(l + t) (l + «)(l + t) \2(l + s) 2 2(1 + t) 2 (l + a)(l + t) 
6si , „ 6t 2 s 2 



: (t + s + 2st)ab' 2 + ^-f vi bA + °( a3 + b 5 + c 2 ' 



(l+t) 2 (l + s) 2V ' (l + i) 2 (l + s) 2 

The asymptotic expansion (|68p implies, in particular, 
(69) / A "W(0,0) = 1, 



(70) /AnW (t)0) = _l_^ 



1 + i 2(1 + t) s 



and 



(71) /A.«>(„, .) = ^_ (i - ^ + + 0(o * + * + 

Define 
(72) 

F A "W%,s) := / A "W(0,0) - / A «^(t,0) - / A " W (0,s) + / A " W (t,s), 
so that in the new notations (|63|) is 

(73) n\\U\\\\V\\] = 4- / F A »W(M)- 



2Wo 7o V ' '{isf/i- 
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Plugging the estimates (|68j) . (|69j) . f[T0 j) and ([71]) into the definition (J72J) of 
F A„(V) y i e ids 

(74) 

^' aJ -(l + t)(l + a ) + (l + t)2(l + a )2 

2s4 2 ts (t + 10ts + s + 3is 2 + 3 * 2 s - 2 ) 2 

+ (l + S ) 2 (l + 4) 2 2(l + t)3(l + S )3 ° 

(4 + s + 2s 4) • ab 2 + - f* 2 .f .„ • b A + 0(a 3 + b 5 + c 2 



(l + t)3(l + a )3V "V (l+*)3(l + s )3 

where the constants involved in the "O" -notation are universal. We wish 
to plug (|74j) into (|73|) and integrate with respect to 4 and s. The problem 

oo 

is that the integral J diverges at the origin so that the bound for the 
o 

error term in ([74|) is not sufficient. To resolve this isssue we notice that (J72J) 
implies that we have 

(75) F A " W (M)|t=o = F A "W(t )S )| s=0 = 0. 

and identify the expression (|74p as the Taylor expansion of F^ n ^\t,s) 
considered as a function of (a, 6, c) with fixed parameters 4, s around the 
origin (a,b,c) = (0,0,0). The vanishing property ([75]) implies that all the 
Taylor coefficients in the expansion (|74p considered as a function of 4, s, are 
divisible by 4s, so that we may improve the error term in (|74p as 

(76) 

F A„ W(M)= ts | s4(2 + 4 + s) 



(l + 4)(l + s) (l + 4) 2 (l + s) 2 

2s4 2 4s(4 + lOts + s + 34s 2 + 34 2 s - 2) 2 

+ (l + s )2( 1 + t )2 ' b 2(1+4)3(1 + 5)3 ° 

6s4 , , o 64 2 s 2 ,/ 

(4 + s + 2s4) • a& 2 + • 6 4 



(l+t) 3 (l+ S f ' (1 + 4)3(1 + S 

+ 0(m(4,s)-(a 3 + 6 5 + c 2 )), 
where we introduce the notations m(t) := min{4, 1} and 

m(t, s) := m(t) ■ m(s). 
Plugging (|76p into (|73p and integrating term by term we obtain 
(77) 

E[||Lr||||F||] = | + |.a + |-6 2 -^-a 2 -^.a6 2 + ^ 4 + 0(a 3 + 6 5 + c 2 ), 
where we used the standard integrals 

oo oo oo 

f dt - f dt _ 7r /'^J_ 7r 
J Vt(i + t) _7r ' J Vt(i + tr~r J JTTW 2' 



oo oo oo 

y (iTf^ - T' y (i+f - s ' y v*(i+«) 3 ~ 
000 
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We finally plug the estimates ([62]) and ([7?]) into ([6T|) to obtain ([5?]) . that 
is the statement of the present Lemma. □ 

3.4.2. Some estimates related to the matrix elements. In this section we 
evaluate the various expressions appearing in ([5?]) asymptotically as ip — > oo, 
namely prove Lemma 13.91 To evaluate the matrix elements a, b, c we will 
need to deal with the asymptotic behaviour of the Legendre polynomials of 
high degree. The reader may find the necessary background on the Legendre 
polynomials as well as some basic asymptotic estimates in Appendix [B] (see 
Lemma IB.3|) . 

Proof of Lemma \3. M It is easy to check that parts IMTUl of the present lemma 
follow directly from parts HB Moreover, part [T] may be obtained by a 
straightforward application of (|116p . and part0]is a direct consequence of the 
high degree asymptotics (|1 17|) for the derivatives of Legendre polynomials. 
It then remains to prove parts [2][3j 

Recall they we assume that C < ip < 7rm/2, so that P n (cos(ip/m)) is 
bounded away from 1 by Hilb's asymptotics (j!15|) . Hence we may write 

(78) 

a n (ip) = 7T P' (cos(ip / m)) 2 sm(ip/m) 2 ^Picosiip lm)) 2 P\cos(ip lm)) 2 sm(ip/m) 2 

^ [nip ^ ?/> 3 ) ' 
where to bound the error term we used the decay 

(79) \P'(cos(ip/m))\ = O 1 1 

which follows from (|117|) . 
Now we use (|117p to obtain 

(80) 



^,3/2 J ' 



-rrP' (cos( ib/ m)) 2 sm(ip/m) 2 = . . , . — r-r ( smiip/m) 2 sin (ip — ^) 

n 2 irnsm(ip/m) 3 \ V 4/ 



sinty/m) cos(2^) + O ( \) )+ O (± + ± 
on \n z J J \ip° nip 



l-sin(2^) 3cos(2^) + Q / 1 + 1 



7rnsin(^/m) Aim 2 sin(ip/m) 2 \-0 3 nip J 
and ([58]) together with ([80]) imply 
(81) 

^P n (cos(^/m)) 2 ^(cos(^/m)) 2 sin(^/m) 2 = - \ ± C ° s(4 f +o(^ + —\ 
n z Lit z n L svo.[ip / my yip nip J 

Substituting (|80p and (|8ip into (|78p we obtain part [2] of the present lemma. 

It then remains to prove part [3] of the lemma i.e. establish a two-term 
asymptotics for b n {ip) 2 . To achieve that we first evaluate b n {ip). From the 
definition (|4"7|) of b n {ip) we have, using (f79|) to replace E n = n(n + 1) by n 2 
and 

/^ 2N 

cos(ip/m) = 1 + O [ — 
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to replace cos(ip/m) by 1 

bni^) = ^Pl l (cos(i/j/m))cos('i{j/m) ^ P" ( coshp lm)) sin(Wm) 2 

+ ^2 P n(cos^/m))P' ll (cos(^/m)) 2 sin^/m) 2 + O ^7^7^ + 



— jr-Pl (cos(ip/m)) 7; P'J, (cosN) I 'm)) sin(Wm) 2 



1 /ll 

+ -^^(cos^/m))^^^/^)) 2 sin^/m) 2 + O (775/2 + J ■ 

Next we use the differential equation (|113p satisfied by the Legendre poly- 
nomials to write 

b n (ip) = P n (cos(VyW)) - —^P' n (cos(ip/m)) 



+ \p n {cos(i>/m))P' n (cos{i)/m)) 2 sin^/m) 2 + O ( —rjx + ' 



^ m{r + ^ Icos^ + f^ ,/2 sin(^-f) 



imshi(ip/m) y 4 8 ip J V vr re 3 / 2 sm(i[)/m) 3 / 2 



2 . . ( vr. l-sin(2^) ^ / 1 1 



Ttnsm.{ip/rn) 4 7rnsin(^/m) \ip 5 / 2 ny/ip J ' 

where we used (|116p . (|117|) and reused ([80]) once more to obtain the second 
equality. Reorganizing the terms in the last expression, we have 



■jmsmty/m) 4 8 y nn sin(ijj /m) ip 

| ygj (sin(y; + f) + lcos(3^ + f)-^cos0/;-f)) ]Q ( ^ ] 1 



7r 3 / 2 n 3 / 2 sin(^/m) 3 / 2 \^ 5 / 2 n\A/> 

" . r . , 7T 7 / 2 cos(^ + I ) 



y 7rnsin(^/m) 4 8 y 7rnsin(^/m) ^ 

| ( s in(V> + f)+cos(3^ + f)) | / 1 | 1 



V^vr 3 / 2 ?! 3 / 2 sin('0/m) 3 / 2 V^ 5 ^ 2 riy/tf J ' 

and we obtain part [3] of the present lemma by squaring the last equality. 



□ 



4. Proof of Theorem 11.41 

In this section we assume that ip : S 2 — > M. is a continuously differentiable 
even function. For the sake of proving Theorem 11.51 we will conduct the 
analysis of the error terms in terms of the L°° norm ||^||oo and the total 
variation V{(p) of the test function, as prescribed by Theorem 11.41 

Our first goal is to formulate an analogue of Proposition 12.71 for the vari- 
ance of Z ip {f n ). It turns out that a certain auxiliary function defined 
below comes out from a straightforward repetition of the steps we performed 
in section [231 adapted to suit Z v rather than Z. 
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For (p G C l (S 2 ) the analogue of (J25J) is 



E [^(/n) 2 ] = /y <p{x)«p(y)Kn(x,y)dxdy, 



S 2 xS 2 



where K n (x,y) is given again by ()26p . Since K(x,y) = K(cp), where <fi 
d(x,y), we may employ Fubini, to obtain (cf. ([351) 1 



7T 

E[Z^(f n ) 2 ] =2vr|5 2 | y ^ n (0)^(0)#, 
o 

where W v : [0, 7r] — > M is a continuously differentiable function defined by 

(82) w^) := JL y <p( x )<p( y )dxdy. 

d(x,y)=(j> 

For example, for the constant function </j = 1 we have 

W l (4>) = sin(0). 
It is easy to check that, since d(x, —y) = ir — d(x, y), we have 

(83) W^iir -</)) = W v (<f>), 

as we assume that tp is even. 

Scaling the integrand in the same manner exactly as in section 12. 5[ we 
finally obtain the following lemma (cf. Proposition 12. 7p . 

Lemma 4.1. The variance of Z^{f n ) is given by 

(84) Var(^(/„)) = 47r 2 -^^, 
where 

■wm 

I% = J (KM - ^ W(rl>/m)dfJ> 
o 

Remark 4.2. One deduces from ([50]) and ([83|) that 

nm/2 

(85) 11 = 2 J (K n {ij)--^W^^/m)d^. 

o 

We will need some rather simple properties of W^. Writing the double 
integral (|82p as an iterated integral and using the spherical coordinates with 
pole at x for each x S S 2 we obtain 

(86) W^) = g^sm($W2r(0), 
where 

«W) = J ,(e*p, (<J .„)>*, 

is a continuously differentiable function with 

(87) <(0) = 2tt|M|| 2(52) , 
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|2 



\Wq(4>)\ < 2ir\\tp\\ 00 \\tp\\ms') < 87r 
and derivative uniformly bounded by 

(89) W(0)| < 27r||^|| 00 ^(^). 



Now we pursue the proof of Theorem ll.4[ By (|84p . evaluating the variance 
of is equivalent to evaluating It, and for notational convenience we 
choose to work with the expression (|85p . As in the proof of Theorem 11.11 
we choose a constant C > 0, which remains fixed throughout the present 
section, and divide the interval [0, 7tto/2] = [0,C] U [C, -Km/2] (see section 

EH), 

We then have the following lemma (cf. Lemma l3.3|) : to prove it just use 
(|86|) and the bound ([88]) for Wq together with Lemma 13,21 

Lemma 4.3. For any constant C > 0, we have as n — > oo 



(90) 
where 

(91) 



It = lit + O 



llvll 



nm/2 



it 



dip. 



Proof of Theorem \l-4\ First we evaluate It as defined in (|9ip . Plugging 
(1561) mt o dSH), we have (cf. 



(92) 



■Km/2 



C 



1 sin(2V>) _65 1 9 cos(2^) 

2irnsm(ip/m) 256 7r 2 n sva.{tp / m)tp ^2 -nnip sm{ip / m 

/ 7rm/2 



+ 



^sin(2V>)-^cos(4V>) 



1 



7r 2 nip sin(^/m) 

7rm/2 



( — I + O 

m 



1 1 



m 



16vr 3 



77 



. 65 1 9 cos(2V>) 
sin(2V>) + ™— + — + 



27 sin(2^) 



128 



cos(4?/>) 



+ o 



c 

I II 2 1 
M oo- 

n 



128 vrV' 16 V 



7n0 



with with constants involved in the "O" -notation universal. Here we used 
the identity (|86|) : to effectively control the error term we use 
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We integrate by parts the first oscillatory term in (|92p . using the contin- 
uous differentiability assumptions; this yields the bound for its contribution 

■Km/2 

< i f sin(2^)W^(V>An)# 

Ttm/2 

cos(2ip)Wg'(ip/m)diJj 



n 

c 



«i|c s(2^).<(^/m)||5 =7rm/2 + l 



c 



« ^ + MWd « (ML + WUVM) . I 

n n n 

with constants involved in the " <C "-notation universal, by ()89p . It is easy 
to establish similar bounds for the remaining oscillatory terms in (j92|) i.e. 
the 3rd and the 4th terms. 

To analyze the main contribution, which comes from the remaining second 
term in ()92p . we note that the continuous differentiability of Wq implies 

W {<j>) = 2tt\\(p\\ 2 l2{s2} + 0\ Mgo y iv )(<f>), 

by ()87p and (J89j) . The main contribution to ()92p is then 

nm/2 Txm/2 

65 1 f Wgty/m) 65 2 1 f dtp 

2048^n y ^ W24^ MlL2 ( s2 )'n J ~ + U M->n<?) 

c c 



/ irm/2 \ 
1 



n 2 



( / 



65 n ||2 logra L/n /1\ 



All in all we evaluated In as 



n 



Plugging this into (fUU|) yields 

ta*\ rv 65 2 log re „ /1\ 

(93) 4 = 512^3 iiviix^cs 3 ) • — + ^- ) ■ 

We finally obtain the statement of Theorem 11.41 by plugging (|93p into (|84p . 

□ 

5. Proof of Theorem 11.51 

As implied by the formulation of Theorem 11.51 i n this section we will 
deal with functions of bounded variation. The definition and some basic 
properties of the class BV{S 2 ) of functions of bounded variation is given in 
Appendix O 

5.1. On the proof of Theorem 11.51 To prove Theorem 1 1 . 5 1 one wishes to 
apply a standard approximation argument, approximating our test function 
tp of bounded variation with a sequence (pi of C°°, for which we can apply 
Theorem II. 4[ There are two major issues with this approach however. 

On one hand, one needs to check that <pi approximating ip implies the 
corresponding statement for the random variables Z v (f n ) and Z Vi (f n ), and, 
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in particular, their variance. While it is easy to check that if <pi — > (p in L 1 
then for every fixed n we also have 

E[Z^(f n )]^E[Z^(f n )}, 

the analogous statement for the variance is much less trivial (see Proposition 



On the other hand, when applying Theorem 11.41 for ipi, one needs to 
control the error term in (|14l) . which may a priori depend on <pi. To resolve 
the latter we take advantagqjof the fact that Theorem 1 1 .41 allows us to control 
the dependency of the error term in (|14p on the test function in terms of 
its L°° norm and total variation. Thus to resolve this issue it would be 
sufficient to require from <pi to be essentially uniformly bounded and having 
uniformly bounded total variation. 

Fortunately the standard symmetric mollifiers construction from [13] as 
given in Appendix O satisfy both the requirements above. Namely given a 
function <p G BV(S 2 ) we obtain a sequence (fi of C°° function, that converge 
in L 1 to ip, Halloo < IMloo and in addition 

V(<pi) -> V(<p). 

5.2. Continuity of the distribution of Z^. As pointed in section [5TT| to 
prove Theorem 1 1.5 1 we will need to show that the distribution of Z^ depends 
continuously on ip. Proposition 15.11 makes this statement precise. We believe 
that it is of independent interest. 

Proposition 5.1. Let ip G BV(S 2 ) nL OD (S 2 ) be any test function. Then 
(94) E [Z*(f n ) 2 ] = O (n 2 |M|i 1(iS2) + IMUIMk^ 

where the constant involved in the "O" -notation are universal. In particular, 
if F C S 2 has a C 2 boundary then 



E 



Z F Un)f] =0{n 2 \F\ 2 + \F\). 



Proof. Recall that we defined W v as (f52"j) ; the assumption ip G L°°(S 2 ) saves 
us from dealing with the validity of this definition. Starting from (|121|) . and 
repeating the steps in proof of Lemma 12.11 from either [23J or [HE], we 
may extend the validity of the Kac-Rice formula (|84|) with (|85|) for this 
class as well. Note that the constant term in (|85p comes from the squared 
expectation, so that we need to omit it if we want to compute the second 
moment. We then have 



(95) E 0Z*(/n)) 



n + 1/2 



Proposition 15. ll gives a stronger claim. First, it evaluates the second moment rather 
than the variance. Secondly, it gives a general bound for E [(Z Vi (/„) — Z v (f n )) 2 ] = 

b[(z«"-*' (/ B )) 2 ]. It is easy to derive the result we need employing the triangle inequality. 

^This is by no means a lucky coincidence; it is precisely the proof of Theorem 1 1 . 5 1 that 
motivated the technical statement made in Theorem 11.41 
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■Km/2 



K n (V) W v ( — ) d<j>, 
m 



denoting as usual m := n + 1/2. 

As usual while estimating this kind of integrals we remove the origin by 
choosing a constant C > and writing 

(96) = J^i + J n> 2, 

where 

c 

= f K n {i>)W*(^\ # . 



and 



-Km/2 



J n,2 



K n ( — ) #. 

m 



First, for C < V < ^r> K n {ip) is bounded by a constant, which may 
depend only on C i.e. 

l^nWI = C (1), 

which follows directly from Proposition 13.51 Therefore we may bound J^ 2 

as 

■Km/2 Km/2 



/ \w (£)|#< / 



(97) 



c 

tt/2 



m / |^(^)|#<n|| ¥3 ||i 1(52)) 



o 

as earlier. 

We claim that for < ip < C we may bound if n as 

1 



(98) 



\K n ^)\=Oc 



Before proving this estimate we will show how it helps us to bound J^\- We 
have by the definition of 



C 



(99) 



1 

o 

C/n 



c 



dip < 



n 



1 m 



dtp 



< / \ W l |0)I#<C -IMUIMIlIOS 2 ); 



by ([86]) and the first inequality of 

The statement of the present lemma now follows from plugging the esti- 
mates ([97]) and ([99} into ([96]) and ([95]). We still have to prove ([98]) though. 
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To see (f98|) we use Remark 12.101 and the Cauchy-Schwartz inequality to 
write 

(100) KM = 1 E [||LT|| • \\V\\] , 

(27r)y 1 - P n {cosil)/mY 

where f7 and V are 2-dimensional mean zero Gaussian vectors with covari- 
ance matrix (|45p . whose entries uniformly bounded by an absolute constant, 
whence 



(101) e [\\u\\ ■ \\v\w < ^/nwmmww 2 ] = oa), 

with the constant involved in the "O" -notation uniform. For the other term 
Lemma IB.2I yields 

(102) y/l - P n (cos(^/m)) 2 « I 

so that we obtain the necessary bound ([98]) for K n (tp) plugging the estimates 
(flUTD and {MJ into (fTUUj) . 

□ 

5.3. Proof of Theorem 11.51 Now we are ready to give a proof of Theorem 

Proof of Theorem \1.5[ Given a function tp E BV(S 2 ), let ipi G C 00 be a 
sequence of smooth functions such that tpi — > <p in L 1 (5 2 ), 

^ := T/(^) -> 

and 

(103) Halloo < IMloo- 

(see Appendix EJ. Let M 1 := \\ 

<p\\oc an d 
M 2 := max{l/j}j>i < oo, 

since Vi is convergent. 

Theorem 11.41 applied on ipi G C°°(5 2 ) states that 

(104) Var(^(/ n )) = c(^) • logn + Ml , Ma (l), 
where c(cpi) is given by 

C( ^ } := 65 ^28^ > °" 

Note that since ipi and </? are uniformly bounded (|103p . L x (5 2 ) convergence 
implies L 2 (S 2 ) convergence, so that 

(105) c(<pi) -> cfa), 

the latter being given by (fl~3j) . 

On the other hand we know from Proposition 15.11 that 

,2" 



E 



(#«(/„) -Z*(/ n )) 2 =E 



0, 



using the uniform boundedness (|103p again to ensure that ([M]) holds uni- 
formly. This together with the triangle inequality implies that 

(106) Var (Z^(f n )) Var (^(/„)) , 
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and we take the limit i — > oo in (|104p to finally obtain the main statement 
of Theorem 11.51 

□ 

Remark 5.2. From the proof presented, it is easy to see that the constant 
in the "O" -notation in the statement (|14p of Theorem 11.51 could be made 
dependent only on 1 1 y 1 1 oo an d V(tp). 

Appendix A. Computation of the covariance matrix 

In this section we compute the matrix Q n (<f>) explicitly, as prescribed by 
(|37|) . The matrix fi n (</>) is the 4x4 covariance matrix of the mean zero 
Gaussian random vector Z2 in ([23p with x 7^ y £ S 2 any two points on the 
arc {9 = 0} with d(x,y) = <fi, conditioned upon f(x) = f(y) = 0. Recall 
that as such, Q n (4>) is given by (f34"|) . where A = A n (x,y), B = B n (x,y) and 
C = C n (x,y) are given by ([30j) . ([31]) and ([32]) respectively, and x,y G S 2 
are any points on the arc {8 = 0} with d{x, y) = (f). Here the gradients are 
given in the orthonormal frame (|36p of the tangent planes T X (S 2 ) associated 
to the spherical coordinates (see section [2~4l for explanation). 

Let x and y correspond to the spherical coordinates (<t>x,0 x = 0) and 
(4>y,6 y = 0), and denote 

4> = d{x,y) = \<f) x - 4>y\. 

Recall that 

u n (x,y) = P n (cos(d(x,y))) = P n (cos0). 
First we compute the inverse of A in (|30p as 

It is easy to either see from the geometric picture or compute explicitly 
that 

(108) V x u n {x,y) = -V y u n {x,y) = ±P^(cos <£) sim>)(l, 0), 
depending on whether (f> x > 4> y or (f> x < <fi y , so that 

(109) B n {<j>) = ±( Dlf ° . ° ^(cos0) S in0 0\ 

Next we turn to the missing part of C n (<p) defined in (|3"2j) . i.e. the "pseudo- 
Hessian" H n (4>) given by (f3"3"|h By the chain rule 

#"„(</>) = (Vje <8> V„) u n (x, y)=V x ® P' n {cos{d{x, y)))V y cos(d(x, y)) 

(110) = P"(cos (j))V x cos(d(x, y)) ® Vj, cos(d(x, y)) 
+ P' n {cos<j)) (V x ®Vy)cos(d(x,y)). 

We denote 

h(x, y) := cos <i(a;, y) = cos <^> x cos (f> y + sin^ sin^y cos(#a; — 9 y ), 
and compute explicitly that for X = y = we have 

(111) (V* ® V„) cos(d(x, y)) = (Va, V„) y) = 



COS ( 
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Plugging ([TPS]) and (fTTT]) into (fTTO]) we obtain 

, * /P^(cos(/>)cos0-P^(cos</>)sin(0) 2 

( 112 ) F= V J* (cos* 

Finally plugging (| 11 2 () into ([32]) . and plugging that together with (|107|) and 
()109|) into we obtain an explicit expression for £l n (<fi) as prescribed by 
(1371) with entries given bv fl55J), (i39l) and (gDJ) . 



Appendix B. Estimates for the Legendre polynomials and 

RELATED FUNCTIONS 

The goal of this section is to give a brief introduction to the Legendre 
polynomials P n : [—1,1] —> R and give some relevant basic information 
necessary for the purposes of the present paper. The high degree asymptotic 
analysis of behaviour of P n and its first two derivatives involves the Hilb's 
asymptotics in Lemma IB. II together with the recursion (|114|) for the 1st 
derivative and the differential equation (|113|) for the second one. We refer 
the reader to |17j for more information. 

The Legendre polynomials P n are defined as the unique polynomials of 
degree n orthogonal w.r.t. the constant weight function u(t) = 1 on [—1, 1] 
with the normalization P n (l) = 1- They satisfy the following second order 
differential equation: 

(113) 

P^cos^/m)) = - n(n + 1 | 2 P n (co S (^/m)) + ^^ ^(cos^/m)), 
sm{ip/in) z sm{yj/m) z 

as well as the recursion 
(114) 

Ti 

P^(cos(V> / m)) = (P n -i (cos(V> / m)) — cos(ip/m)P n (cos(i/j / m))) 



sm(ip / m) 2 

The Hilb asymptotics gives the high degree asymptotic behaviour of P n . 
Lemma B.l (Hilb Asymptotics (formula (8.21.17) on page 197 of Szego |17|)). 

/ i \ 1/2 

(115) P n {cos<t>)=[-^-) J Q {{n + l/2)<t>)- 



sm< 



uniformly for < (j) < it/2, Jq is the Bessel J function of order and the 
error term is 



m « 



V /2 0(n~ 3/2 ), Cn' 1 <4><-k/2 
<f) a+2 0(n a ), 0<<p< Cn' 1 , 



where C > is any constant and the constants involved in the "O" -notation 
depend on C only. 

We have the following rough estimate for the behaviour of the Legendre 
polynomials at ±1, which follows directly from Hilb's asymptotic. 

Lemma B.2. For < <p < ^ one has 

1 - P n (cos(0)) 2 > n 2 2 , 
where the constant in the "^"-notation is universal. 
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Lemma B.3. The Legendre polynomials P n and its couple of derivatives 
satisfy uniformly for n > 1, ip > C: 

(1) 



, IT 1 COs(^ 

P n {cos(ip/m)) = a sm(V> + 



Trn sm(ip / m) \ 4 8 ip 

+oi4.+ 1 



1p 5 /2 \p\f)\ 



n 



(116) 
(2) 



(117) V ^ sin^/V 2 V [nWm) ^ ^ " 3 + t ^ + \ 
n 2 n 



Pn(cos{ip/m)) 



^7/2 ^3/2 



(3) 
(118) 



n 2 „ „ , , , „ s 2 „, „ , , , ^ . „ / n 3 



P"(cos(?/>/m)) = — — — P n (cos(VVm))+ . , . . 2 i* (cos(^/ro))+0 . 

Proof. By Lemma fB.ll and the standard asymptotics for the Bessel functions 
we obtain 



y / sm(^/m) » 



2 fsin(V> + f) lcos(^ + f)^ Q M ,^ 



t y/sm(\f/m) V 8 V^ 2 / VV' 5//2 ' ™ 2 



2 , tt n lcos(i/> + §)\ 1 1 



sin(V + -) - , 4/ + O -— + 



y Tin sm.{ip/m) \ 4 8-0/ \?/; 5 / 2 v^ 77 - 

which is (|116p . 

To obtain ()117p we employ the recursive formula (|114p . evaluating the 
Legendre polynomials appearing there using (|116p . Finally we obtain a 
simple approximate differential equation (|118p . replacing n(n + 1) by n 2 and 
cos(ip/m) by 1 in the differential equation (|113|) satisfied by the Legendre 
polynomials. To do so we use the decay 

|P n (cos(V/m))| = O U= 

of P n , which follows directly from (|116p . as well as (|79p of its derivative. 

□ 



Appendix C. Functions of bounded variation 

In this section we give the definition and some basic properties on the 
functions of bounded variation. For more information we refer the reader 
to H3]. 
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Classically, the variation of a function rj : [a, b] — )■ M on [a, x] is defined as 

k-i 

V(rj;x):= sup J2\v(ti+i) ~ 



i=l 



where the supremum is over all the partitions A of [a, a;]. We denote I := 
[a, b]. If rj G C l (I) then the variation is 

X 

V{ m x) = [ \r,'(t)\dt. 



In fact, the last inequality holds even for rj G W 1 ' (I), where for this class 
of functions the derivative rj' is the weak derivative. 

This definition has two major disadvantages. First, one wishes to identify 
functions 

(119) 7/1 ~ r/2, if 7/i (x) = f]2(x) for almost all x G /. 

However, altering the values of rj on a measure zero set does impact its 
variation. Secondly, one cannot extend this definition for the multivariate 
case. 

We then need to find a better definition. Fortunately, the following defi- 
nition eliminates the disadvantages of the previous one. Let 

X 

V(t];x) := sup / rj(t)g'(t)dt, 

9 J 
o 

where the supremum is over all the continuously differentiable functions 
g : [a, x] — > E with \g(t)\ < 1 for all t G [a, a;]. The number V(rj) := V(r];I) 
is called the total variation of 7/ on /. We define the space BV(I) to be the 
equivalence classes of functions 7/ with finite total variation, i.e. 

BV(I) := {r, G L l {I) : V{n) < oo}/ ~, 

where the equivalence relation is given by (|119p . It is known [13] that 

W X '\I) C BV(I). 

We may extend the latter definition quite naturally for the multivariate 
case. Of our interest is the case of the sphere. Let if G L l {S 2 ) be an 
integrable function. We define its variation on an open subset Q C S 2 as 

V(ip; Q) := sup / (f(x) div g{x)dx, 

9 J 

n 

where the supremum is over the continuously differentiable compactly sup- 
ported vector fields 

gec^^rn) 

with \g(x)\ < 1 for all x G 0. We define the total variation as 
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The space BV(Q) is defined as the equivalence class of functions tp with 
V(tp) < oo, with the equivalence relation (|119p adapted to the sphere. Again, 
for a smooth (and W 1,1 (S 2 )) function tp 6 C l (S 2 ) we have 



V(<p) = J \\Vtp(x)\\dx, 

and 

W 1,1 ^) C BV{p). 

For a function </? G £?V(5 2 ) [13], Theorem 1.17 gives a constructior|]of a 
sequence ^ G C°° of smooth test functions such that tpi — > tp in L 1 (5 2 ) as 
well as 

Moreover, part (b) of that theorem implies that 

Halloo < IMloo- 

We are interested in the linear statistics of the nodal sets of smooth func- 
tions, where the test functions are of bounded variation. The definition of 
the linear statistics is natural for continuous test functions <p : S 2 — > R as 



Z*(f)= J tp(x)dx, 
/-Ho) 

i.e. integrating the restriction of tp on the nodal line. However, things 
become more complicated as one drops the continuity assumption; since the 
values of tp e BV{S 2 ) (or tp G L l (S 2 )) are only defined up to measure zero 
sets, there is no meaning to restricting tp on curves. In general, one cannot 
define linear statistics corresponding to integrable functions, and to define a 
notion of trace of tp on a smooth curve C, we will have to exploit the values 
of tp in a tubular neighbourhood around C . Such a construction is known for 
the functions belonging to the class W > (S 2 ), i.e. for every smooth curve 
C C $7 there exists a map 

trc : W l ' l (n) -»• L l {C) 

satisfying the natural properties. 

The situation is more involved in the -B^-case, which is essential to us, 
since W 1,1 does not contain the characteristic functions of nice spherical 
subsets. A smooth curve divides the sphere and a tubular neighbourhood 
around it into two parts. One may then define |13| . chapter 2, two traces 
tp + = trj tp and tp~ = tr^ tp both belonging to L 1 (C), corresponding to the 
values of tp on the different parts. The traces tp + and tp~ may in general be 
different! 10 !, and moreover, one cannot canonically distinguish between the 
traces. For instance, if F C S 2 is a nice subset, and xf is its characteristic 
function, then tvQp(xF) might be defined as either 1 or 0, depending on 



This book gives only the theory of functions of bounded variation on R m . One can 
obtain a similar theory for the sphere only slightly modifying the one given. 
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whether we approach the circle from inside or outside the disc respectively. 
Accordingly, the corresponding linear statistic might be \en(dF) or 0. 
We define the average trace of (p on a smooth curve C C S 2 as 

(120) ^ -=\p + + 

and this is the notion that appears in the formulation of Theorem 11.51 and 
throughout the present paper. For ip E L°°(5 2 ) we have 

Halloo < IMIoo- 

Following the approach of [13] . (2.10) and Federer's co-area formula |11| . 
one may obtain the inequality 



cp(x)dx — J (f • (x)dx 
f-Ht) /-i(o) 



dt 



= O f (V(^r 1 ((0,e)))+ sup llena- 1 ^)-^/- 1 ^))! 

V 0<t<e 

As (3 — >• 0, the right hand side of the last inequality vanishes. Therefore we 
have the following Kac-Rice type formula 

<f + (x)dx = lim - I \\X7 f(x)\\ip(x)dx, 



e^o e 

/-1(0) 0<f(x)<e 



and similarly 



y3 [x)dx = lim - / ||V/(x)||<^(x)(ix. 



/-i(0) -e</(x)<0 

Combining the last two formulas we obtain 

(121) J <p ± (x)dx = \im± J \\Vf(x)Mx)dx. 

/-Ho) |/(x)[<6 

We employ ()12ip to extend the validity of the Kac-Rice formula for the 
second moment for (p € BV(S 2 ) (see f[95|l ). 
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